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Genome-wide association studies have shown that the rs340874 
single nucleotide polymorphism (SNP) in PROX1 is a genetic 
susceptibility factor for type 2 diabetes. We conducted genetic 
and molecular studies to better understand the role of PROX1 in 
type 2 diabetes. We assessed the impact of the whole common 
genetic variability of PROX1 (80 SNPs) on type 2 diabetes- 
related biochemical traits in the HELENA (Healthy Lifestyle 
in Europe by Nutrition in Adolescence) study (n = 1,155). Three 
SNPs (rs340838, rs340837, and rs340836) were significantly as- 
sociated with fasting plasma insulin levels (P < 0.00295). We 
evaluated the impact of nine PROX1 SNPs (the three insulin- 
associated SNPs plus six SNPs in strong linkage disequilibrium) 
on luciferase reporter gene expression. The insulin-lowering 
alleles of rs340874, rs340873, and rs340835 were associated with 
lower luciferase activity in MIN6 and HepG2 cells (except for 
rs340874, which was in HepG2 cells only). Electrophoretic mo- 
bility shift assays indicated that specific nuclear protein bind- 
ings occur at the three SNPs in HepG2 cells, with allele-binding 
differences for rs340874. We also showed that the knockdown 
of Proxl expression by small interfering RNAs in INS-IE cells 
resulted in a 1.7-fold reduction in glucose-stimulated insulin 
secretion. All together, we propose that reduced expression of 
PROX1 by cis-regulatory variants results in altered (3-cell insulin 
secretion and thereby confers susceptibility to type 2 diabetes. 
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Type 2 diabetes is a major public health issue that 
will affect >439 million people worldwide by 
2030 (1) and is characterized by impaired (B-cell 
function and insulin resistance, leading to 
chronic hyperglycemia (2). Hyperglycemia is the main 
cause of microvascular and macrovascular complications 
(3). Type 2 diabetes is a complex metabolic disease in 
which both environmental and genetic factors are involved. 
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To date, —40 genetic loci have been found to be associated 
with type 2 diabetes (4). 

The Meta-Analysis of Glucose and Insulin-Related Trait 
Consortium (MAGIC) combined the results of 21 genome- 
wide association studies (GWAS) in terms of fasting glucose 
(FG), fasting insulin (FI), and indices of p-cell function 
(homeostasis model assessment [HOMA]-B) and insulin 
resistance (HOMA-IR) in 46,186 nondiabetic subjects (5). 
The authors reported 2 loci associated with FI and HOMA- 
IR and 16 associated with FG and HOMA-B, including the 
rs340874 single nucleotide polymorphism (SNP) in the 
prospero homeobox 1 (PROX1) gene. They also showed 
that carriers of the glucose-raising C allele for the rs340874 
SNP display a higher risk of type 2 diabetes. Two studies 
with oral glucose tolerance tests showed that the C allele 
of rs340874 is associated with reduced insulin sensitiv- 
ity and secretion (6,7). In children and adolescents, the 
meta-analysis by Barker et al. (8) showed that the 
rs340874 is associated with FG level and, to a lesser ex- 
tent, HOMA-B, with similar effect sizes to those observed 
in adults. 

In humans, PROX1 is located on chromosome 1 and was 
first identified in mice, thanks to its homology with the 
Drosophila homeobox protein prospero (9). PROX1 enc- 
odes a key transcription factor (TF) involved in the de- 
velopment of tissues, such as endothelial lymphatic 
vessels, liver, retina, and pancreas (10,11). Expression of 
PROX1 seems to occur in the specification and pro- 
liferation of pancreatic progenitor cells (11). Indeed, the 
lack of Proxl activity prevents pancreas development and 
affects the organ's cellular structure in mice (11). How- 
ever, the link between PROX1 and type 2 diabetes in 
humans has not been established to date. 

Detailed characterization of PROX1 genetic variability 
could help to elucidate the role of PROX1 in type 2 di- 
abetes and to identify potential type 2 diabetes disease 
pathways. In contrast to previous GWAS focusing on the 
top hit in PROX1 (rs340874), we assessed the impact of the 
whole genetic variability of PROX1 (80 SNPs) on type 2 
diabetes-related traits in adolescents. Genetic studies in 
children and adolescents are less susceptible to con- 
founding environmental factors because of the subjects' 
young age. We also evaluated the functional impact of 
SNPs of interest on reporter gene expression in mouse 
pancreatic (3-cells (MIN6) and human hepatocytes 
(HepG2). Finally, we evaluated the influence of Proxl in 
glucose-stimulated insulin secretion (GSIS) in rat pancre- 
atic |3-cells (INS-IE). 

RESEARCH DESIGN AND METHODS 

The HELENA study. The recruitment and phenotyping of participants in the 
Healthy Lifestyle in Europe by Nutrition in Adolescence (HELENA) cross- 
sectional study (www.helenastudy.com) have been described previously (12). 
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A total of 3,865 adolescents (12-18 years of age) were recruited between 2006 
and 2007 from nine European countries. Adolescents were randomly selected 
from schools by proportional cluster sampling, taking age into account. One- 
third of the classes were randomly selected for blood collection, resulting in 
1,155 samples. Data were collected on a detailed case report form and in 
accordance with standardized procedures. The protocol was approved by the 
appropriate investigational review board for each center. Written informed 
consent was obtained from each adolescent and both of his or her parents or 
legal representatives. Participation in the study was voluntary (13). 

Venous blood samples were drawn after a 10-h overnight fast and sent to 
a central laboratory (IEL, Bonn, Germany) in accordance with standardized 
protocols (14). Serum triglyceride, total cholesterol, HDL cholesterol, LDL 
cholesterol, and glucose levels were enzymatically assayed on the Dimension 
RxL clinical chemistry system (Dade Behring, Schwalbach, Germany). Insulin 
was measured with an IMMULITE 2000 analyzer (DPC Biermann GmbH, Bad 
Nauheim, Germany). DNA was extracted from white blood cells with the 
Gentra Puregene Cell Kit (QIAGEN, Courtaboeuf, France). 

Anthropometric measurements were strictly monitored, with participants 
barefoot and wearing only underwear. Weight (seca 861 electronic scale) and 
height (seca 225 height rod) were measured and BMI calculated. Waist and hip 
circumferences were measured with a nonelastic measuring tape (seca 200). 
Percentage of body fat was estimated from skinfold measurements (15,16). 
Gene SNP selection and genotyping. Using the HapMap database (release 28, 
August 2010) and applying a minor allele frequency (MAF) >0.025 and r 2 > 0.80 in 
Haploview (17), the region otPROXl (chromosome 1 212,223,454.. .212,281,411) 
described six haplotype blocks (NCBI build 36, block 1 rs366684/rs3767844/ 
rs3754138/rs4282786/rs3754140/rs3767848/rs446175/rs4655480/rs726334, block 2 
rslll20242/rsl2089523/rsl2081352/rs6686424/rsl2092859, block 3 rsl0494972/ 
rs7543057, block 4 rs4655313/rs4655314, block 5 rs340835/rs340839, block 6 
rs340837/rs340873) and six independent SNPs. We selected one tag SNP from 
each block (rs3754138, rsl2092859, rsl0494972, rs4655313, rs340835, and 
rs340837) and the six independent SNPs (rsll802122, rs2289002, rs340877, 
rs4655482, rs340874, and rsl2748973) to cover the whole PROX1. In total, 12 
SNPs were genotyped using KASPar technology (KBioscience, Hoddesdon, 
U.K.) (Supplementary Table 1). The genotyping success rate was between 97.9 
and 99.7%. Two SNPs (rsl2092859 and rsl0494972) did not respect the Hardy- 
Weinberg equilibrium (Supplementary Table 1) and, therefore, were not used 
for the imputation analyses. 

Imputation. SNP data were mapped to the National Center for Biotechnology 
Information (NCBI) build 37 coordinates (assembly hgl9) to match the genomic 
position used by the 1000 Genomes European reference panel. IMPUTE2 



software was used to phase the observed genotypes and impute missing 
genotypes in the PROX1 region (chromosome 1 214,156,831-214,214,788, 
NCBI build 37 coordinates, assembly hgl9) (18). We increased the number of 
hidden Markov model states for phasing the study dataset from 80 to 200 for 
higher accuracy and increased the number of hidden Markov model states for 
imputation to take into account all the reference diplotypes in the reference 
panel. The 1000 Genomes version 2 (March 2012) of the 20110521 release was 
used. The reference panel comprised 379 reference haplotypes of the Euro- 
pean ancestry. Only SNPs with an MAF >0.025 in this reference panel were 
considered (n = 100). Imputation quality control was assured by looking at 
the information measure that represents the measure of the observed statis- 
tical information associated with the allele frequency estimate. All SNPs with 
an information measure >0.80 were included for the association analyses. 
From the 10 genotyped PROX1 SNPs, we imputed 70 other SNPs in the PROX1 
region (see linkage disequilibrium [LD] pattern in Supplementary Figure 1). 
Statistical analysis. Departure from Hardy- Weinberg equilibrium within the 
study groups was tested using the x 2 test. Statistical analyses were performed 
with SNPTEST software, using the genotype probabilities to take into account 
the uncertainty (19). 

To obtain normal distributions, a log-transformation was applied for plasma 
triglyceride and insulin levels, HOMA-IR, and HOMA-B. An additive multivariate 
generalized linear model with a missing data likelihood score test was used for 
each trait. The covariates were age, sex, BMI, and study center. 

Results with P < 0.05 were considered nominally significant. Correction for 
multiple testing was applied by dividing 0.05 by the number of independent 
SNPs (assessed with the SNPtagger tool in Haploview 4.2 [www.broad.mit. 
edu/mpg/haploview] with an r 2 threshold of 0.80). Seventeen tag SNPs were 
identified (Table 1), and therefore, results with P := 0.00295 were considered 
statistically significant. Power calculations were performed using Quanto 
version 1.2.4 (20). For molecular experiments, Mann-Whitney nonparametric 
tests were performed with SAS statistical software (SAS Institute Inc., Cary, 
NC). 

Cell culture. MIN6 cells (a gift from B. Staels, Lille, France) were cultured in 
Dulbecco's modified Eagle's medium-GlutaMAX medium supplemented with 
10% FCS (Life Technologies, Villebon-sur-Yvette, France), 2 mmol/L L-gluta- 
mine (Life Technologies), 100 jjimol/L (3-mercaptoethanol (Sigma, Saint- 
Quentin-Fallavier, France), and 100 units/mL penicillin/streptomycin (Life 
Technologies) at 37°C and 5% C0 2 . INS-IE cells (a gift from B. Staels) were 
cultured in RPMI medium supplemented with 10% FCS, 1 mmol/L Na pyruvate 
(Life Technologies), 100 (j.mol/L (3-mercaptoethanol, and 100 units/mL genta- 
micin (Life Technologies) at 37°C and 5% C0 2 . HepG2 cells were purchased 



TABLE 1 



Identification of the tag SNPs among the 70 imputed and 10 genotyped SNPs using SNP tagger (Haploview software) in the 1000 
Genomes European reference panel (March 2012 release) 


Block 


Tag SNP 


Alleles captured 


1 


rs3754138 


rs366684, rs3767848, rs3754138, 1-214181765, rs7541039, rs3767847, 

rs2360068, rs7517701, rsl2073658, rs4655478, 1-214174348, 1-214173376, 
rs2360069, 1-214181768, rs4655477, rs4655479, rs4282786, rs6665764, 
rs4655480, 1-214175220, rs2884890, rs3767846, rs6657185, rs3767845, 
rs726334, rs3767844, rs2289002, 1-214181767, rs446175, rs3754140 


2 


1-214201705 


rs2241674, rs74139088, rs78736023, 1-214201705, rs428330, rs76339943, 
rs399887, rs74929348, rs378414, rs4655482 


3 


rsl832 14928 


rs7548045, rs76135924, rs78321699, rs75779359, rs7536256, rsll6613999, 
rsl832 14928, rsl 14885230 


4 


rs340838 


rs340836, rs340837, rs340838, rs340873, rs340872 


5 


1-214189680 


rs4100072, 1-214189680, rs72753599, rs430425, 1-214176994 


6 


rsl0494972 


rsll3163499, rsl0494972, rs74139093 


7 


rs340876 


rs340877, rs340874, rs340876 




9 


rs235924 


rs340878, rs235924 




rs919655 




11 


rs340839 


rs340835, rs340875, rs340839 




rsll802122 




13 


rsl 14701953 


rsl 14701953 




1-214168535 




15 


rs3767850 


rs3767850 


17 


rs4655315 


rs4655315 



SNPs significantly associated with plasma insulin levels in the HELENA study are indicated in boldface. 
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from ATCC-LGC Standards (Molsheim, France) and were cultured in mini- 
mum essential medium (Life Technologies) supplemented with 10% FCS 
(HyClone, Brebieres, France), 2 mmol/L L-glutamine, 100 units/mL penicillin/ 
streptomycin, 2 mmol/L minimum essential medium nonessential amino acids 
(Life Technologies), and 1 mmol/L Na pyruvate at 37°C and 5% C0 2 . 
Plasmid transfectlons. MIN6 and HepG2 cells were grown at 60-70% con- 
fluence in 24-well plates and were transfected with 500 ng of empty pGL4.23 
[luc2/minP] vector (Promega, Charbonnieres, France) or pGIA-minP-PROXl 
vector with either lipofectamine 2000 (Life Technologies) using a lipofect- 
amine:DNA ratio of 6:1 for MIN6 cells or FuGENE HD (Roche Applied Sci- 
ence, Meylan, France) using a FuGENE:DNA ratio of 6:1 for HepG2 cells. 
Luciferase activities were measured 48 h after transfection using the Dual- 
Luciferase Reporter Assay kit (Promega). The firefly luciferase activity was 
normalized to the Renilla luciferase activity obtained by cotransfection of 50 
ng of the pGL4.74[hRluc/TK] Renilla Luciferase vector (Promega). 
Small interfering RNA transfections and Insulin immunoassay. INS-IE 
cells were grown in 6-well plates at 70% confluence and transfected with either 
the On-Targetpfcts nontargeting pool small interfering RNAs (NT siRNAs) 
(negative control) or a specific On-Targetpfois siRNA SMARTpool against rat 
Proxl or a specific On-Targetpfas siRNA pool against rat cyclophilin B (pos- 
itive control) (Dharmacon, Thermo Fisher Scientific, Lafayette, CO) using 
DharmaFECT 2 according to the manufacturer's instructions. The optimum 
transfection efficiency (>70% transfection, 75% inhibition of cyclophilin B) 
was obtained using a cell density of 5 X 10 6 cells/well, 120 nmol/L siRNAs, and 
12 fj-L DharmaFECT 2 during 48 h (data not shown). After that, INS-IE cells 
were incubated with 1 mL of Krebs buffer for 1 h and then with 3 or 30 mmol/L 
glucose diluted in Krebs buffer for 1 h. The insulin concentration was mea- 
sured in the supernatant (diluted at 1:2,000) using an ultrasensitive rat insulin 
ELISA kit (Mercodia, Uppsala, Sweden). 

Glucose treatment on Proxl expression. INS-IE cells were grown in 6-well 
plates. When the cells reached 60-80% confluence, the culture medium was 
changed to glucose-free RPMI medium supplemented with 3 mmol/L glucose 
(Life Technologies) for 24 h. After that time, the medium was changed for 
glucose-free RPMI medium supplemented with either 3 or 30 mmol/L glucose 
for 24 or 48 h. 

Quantitative real-time PCR. Total RNA was extracted using the RNeasy Plus 
Mini Kit (QIAGEN, Les Ulis, France). cDNA was prepared from 0.5 |xg total 
RNA using the Superscript VTLO cDNA Synthesis Kit (Life Technologies). 
PCRs were performed using the Brilliant SYBR Green QPCR Master Mix and 
the Mx3000P QPCR System (Agilent Technologies, Massy, France). Quantifi- 
cations were performed with TFIIB as the internal standard and presented as 
fold change-over-control values. Primer sequences (forward and reverse, re- 
spectively) were rat PROX1 (ACCCGTTACCCCAGCTCCAACATGC, TTGATG- 
GCTTGGCGCGCATACTTC), rat TFIIB (GTTCTGCTCCAACCTTTGCCT, 
TGTGTAGCTGCCATCTGCACTT), and rat L-PK (liver pyruvate kinase) 
(TGGACATCATCTTTGCCTCCTT, CTGCTAACACGTCACTGGCTTT). 
Western blot analysis. Nuclear protein extracts were prepared using the 
NE-PER Kit (Pierce, Rockford, IL). Forty micrograms of nuclear proteins were 
separated by Novex NuPAGE 4-12% (Life Technologies) and transferred to 
nitrocellulose membranes. The membranes were immunoblotted overnight 
with rabbit polyclonal anti-PROXl (ab38962 diluted 1:250 [Abeam, Cambridge, 
UK]) or rabbit polyclonal anti-Ras-related nuclear protein (RAN) (ab53775 
diluted 1:20,000 [Abeam]). Detection was achieved with horseradish peroxidase- 
conjugated antirabbit IgG goat immunoglobulin (Bio-Rad, Marnes-la-Coquette, 
France) at 1:10,000 for 1 h followed by enhanced chemfluminescence using the 
ECL Western Blotting System detection kit (GE Healthcare, Velizy-Villacoublay, 
France). The image was detected with a Chemidoc camera (Bio-Rad). 
Electrophoretic mobility shift assays. Nuclear protein extracts from HepG2 
and MIN6 cells were prepared as just described. Probe sequences were 40 bp 
long, with each SNP allele at the middle. The forward oligonucleotide was 
labeled with digoxigenin (DIG) at the 3' end. Electrophoretic mobility shift 
assays (EMSAs) were performed using the DIG Gel Shift Kit, 2nd generation 
(Roche Applied Science) with 5 |j.g of protein extracts and 2 pmol DIG-labeled 
probe. For competition EMSA, a 50-fold excess of unlabeled oligonucleotide 
probes was added to the reaction mixtures. Samples underwent electropho- 
resis on a 6% polyacrylamide gel and were transferred to a nylon membrane. 
After washes and blocking, the membrane was immunoblotted for 30 min with 
anti-DIG antibody diluted at 1:10,000 in the blocking solution at room tem- 
perature. Detection was achieved with the CDP-Star chemiluminescent sub- 
strate (Roche Applied Science) diluted at 1:1,000 in the detection buffer. The 
image was detected with a Chemidoc camera. 

RESULTS 

Association studies between SNPs and glucose-related 
variables. Table 2 shows the clinical characteristics of the 
HELENA study participants. The mean age was 14.7 ±1.4 



years, and 48% were boys. We tested the associations be- 
tween the 17 tag SNPs (3 genotyped and 14 imputed) and 
anthropometric parameters, biochemical parameters, and 
the HOMA-B and HOMA-IR indices. No significant associ- 
ations were detected with the anthropometric phenotypes 
or with the plasma lipid variables (data not shown). In 
contrast, significant associations were observed between 
rs340838 and fasting plasma insulin levels (P = 2.95 X 

10 ). Carriers of the major T allele of rs340838 had lower 
insulin levels than those homozygous for the minor allele 
(P = -0.90 ± 0.32 |xIU/mL). To better define this associa- 
tion signal, all 80 SNPs were tested with plasma insulin 
levels (Fig. 1). Two other SNPs (rs340837 and rs340836) 
were significantly associated with plasma insulin levels, 
with the best association detected for rs340837 (P = 2.1 X 
10" 3 ) (Fig. 1 and Table 3). The rs340873 and rs340872 SNPs 
also tended to be associated with insulin levels (P = 0.0043). 
Consistently, the insulin-lowering alleles of the rs340838, 
rs340837, and rs340836 SNPs tended to be associated with 
lower HOMA-B (0.0060 < P < 0.0098) and lower HOMA-IR 
(0.0054 < P < 0.0071) (Table 3). Of note, the GWAS top 
hit SNP (rs340874) was not significantly associated with 
plasma glucose level, insulin level, HOMA-B, or HOMA-IR 
(P = 0.56, 0.096, 0.11, and 0.19, respectively) (Table 3), but 
the data are in line with those of Barker et al. (8) because 
the carriers of the rs340874 C glucose-raising allele had 
lower HOMA-B than the carriers of the other allele (P = 
0.11) (Table 3), especially in girls (P = 0.033) (data not 
shown). 

Luciferase assays. To identify the functional SNPs re- 
sponsible of the association with FI levels, we character- 
ized the functionality of the three insulin-associated SNPs 
as well as of all the SNPs in strong LD (r 2 > 0.80) with 
them (i.e., rs340875, rs340873, rs340872, rs340839, and 
rs340835) (SNPs shown in red in Fig. 1 and in blocks 4 and 

11 in Table 1). We also tested the functional impact of the 
GWAS top hit SNP (rs340874). Because these nine SNPs 
are located in the 5' region or in intron 1 of PROX1, they 
may modulate PROX1 transcription. To investigate this 
hypothesis, we subcloned into the pGL4-minP luciferase 
reporter vector oligonucleotides (60 bp) encompassing 
the alleles of the nine SNPs and assessed in vitro the 
impact of these SNPs on luciferase activity using transient 



TABLE 2 

Characteristics of the participants in the HELENA study 



Characteristic 


Value 


n 


1,155 


Male/female (%) 


48.0/52.0 


Age (years) 


14.7 ± 1.4 


Waist (cm) 


72.4 ± 8.7 


Waist/hip ratio 


0.8 ± 0.1 


BMI (kg/nO 




21.3 ± 3.8 


Body fat (%) 


23.8 ± 9.8 


Glucose (mmol/L) 


5.0 ± 0.4 


Insulin (uJU/mL) 


10.3 ± 7.7 


Cholesterol (mmol/L) 


4.2 ± 0.7 


HDL cholesterol (mmol/L) 


1.4 ± 0.3 


LDL cholesterol (mmol/L) 


2.4 ± 0.6 


Triglyceride (mmol/L) 


0.8 ± 0.4 


HOMA-IR 


2.2 ± 1.5 



HOMA-B 

Data are mean ± SD unless otherwise indicated. 
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FIG. 1. Degree of association between the 80 PROX1 SNPs and FI levels in the HELENA study. The P values (as -log 10 values) of the associations 
between the 10 genotyped (A.) or 70 imputed (#) SNPs and fasting plasma insulin levels are plotted as a function of genomic position (NCBI build 
37) with LocusZoom. Estimated recombination rates (taken from the 1000 Genomes European reference panel) are plotted to reflect the local LD 
structure around the associated SNPs and their correlated proxies (according to a blue-to-red scale from r 2 = 0 to 1 based on pairwise r 2 values 
from the 1000 Genomes European reference panel and taking the rs340837 SNP [best hit] as the reference). 



transfection assays in two different cell lines. MIN6 cells 
were used as a model of pancreatic p-cells, and because 
PROX1 plays a role in liver morphogenesis (21), an organ 
involved in glucose homeostasis, we also used HepG2 
cells. 

Among the nine SNPs, three presented significant allelic 
expression differences (Fig. 2). The rs340873 A allele was 
associated with a 1.5-fold (P = 0.004) and twofold (P = 
0.02) lower luciferase activity in MIN6 and HepG2 cells, 
respectively. The rs340835 A allele was associated with 
a 3.4-fold (P = 0.05) and 4.2-fold (P = 0.02) lower luciferase 
activity in MIN6 and HepG2 cells, respectively. In addition, 



the rs340874 C allele was associated with a 1.65-fold (P = 
0.05) lower luciferase activity in HepG2 cells only. These 
results suggest that PROX1 expression may be lower in 
the pancreas and liver of individuals carrying the insulin- 
lowering alleles. 

EMSAs. To investigate whether the differences in lucif- 
erase activities observed were a result of differential TF 
binding to PROX1 risk alleles, we performed EMSAs with 
nuclear protein extracts of HepG2 and MIN6 cells. Probes 
that contained both alleles of the three SNPs were used. 
No DNA-protein complex could be detected with MIN6 or 
INS-IE extracts (data not shown). In contrast, DNA-protein 



TABLE 3 

Association between PROX1 SNPs and glucose-related traits in the HELENA study 



FG (mmol/L) FI (uAJT/mL) HOMA-B HOMA-IR 



SNP 


Alleles (effect/other) 


Effect 


P value 


Effect 


P value 


Effect 


P value 


Effect 


P value 


rs340875 


C/G 


-0.010 (0.016) 


0.54 


-0.76 (0.31) 


0.016 


-6.96 (4.29) 


0.022 


-0.12 (0.06) 


0.040 


rs340874 


C/T 


-0.009 (0.016) 


0.56 


-0.48 (0.31) 


0.096 


-4.80 (4.25) 


0.11 


-0.09 (0.06) 


0.19 


rs340873 


A/G 


-0.014 (0.016) 


0.38 


-0.88 (0.31) 


4.3E-03 


-7.23 (4.34) 


0.013 


-0.15 (0.06) 


0.015 


rs340872 


T/A 


-0.015 (0.016) 


0.37 


-0.88 (0.31) 


4.3E-03 


-7.23 (4.34) 


0.013 


-0.15 (0.06) 


0.015 


rs340839 


A/G 


-0.009 (0.016) 


0.58 


-0.78 (0.31) 


0.011 


-6.96 (4.28) 


0.014 


-0.13 (0.06) 


0.021 


rs340838 


T/A 


-0.013 (0.016) 


0.42 


-0.90 (0.32) 


2.95E-03 


-7.25 (4.36) 


7.7E-03 


-0.16 (0.07) 


7.1E-03 


rs340837 


T/G 


-0.013 (0.016) 


0.41 


-0.92 (0.31) 


2.1E-03 


-7.50 (4.32) 


6.0E-03 


-0.17 (0.06) 


5.4E-03 


rs340836 


T/C 


-0.014 (0.016) 


0.39 


-0.89 (0.32) 


2.7E-03 


-6.93 (4.38) 


9.8E-03 


-0.16 (0.06) 


6.8E-03 


rs340835 


A/G 


-0.005 (0.016) 


0.74 


-0.69 (0.31) 


0.040 


-4.88 (4.30) 


0.06 


-0.11 (0.06) 


0.093 



Data are per-allele effect (SE). P values were adjusted for age, sex, BMI, and center. Significant P values (after correction for multiple testing) 
are indicated in boldface. SNPs reported are i) the top hit SNP associated with FI levels (rs340837), 2) SNPs in high LD with rs340837, and 3) 
the GWAS top hit SNP (rs340874). 
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FIG. 2. Impact of nine PROX1 SNPs on luclferase activities in MIN6 and HepG2 cells. MIN6 (A~) and HepG2 (B) cells were transiently transfected 
for 48 h with either empty pGL4-minP vector or pGL4-minP-i , iJOXZ vectors. Firefly luciferase activities were normalized to the Rcnilla luciferase 
activities. Activity of the construct is expressed as fold activity compared with empty pGL4-minP vector. Data are mean ± SEM of three in- 
dependent experiments. *P < 0.05 (Mann-Whitney nonparametric test). 



complexes were observed for the rs340874, rs340873, and 
rs340835 probes with HepG2 extracts (Fig. 3). Each DNA- 
protein complex was specific because it was inhibited by 
specific unlabeled oligonucleotide probes. Allele binding 
differences could be detected for the rs340874 SNP only, 
with a 60% higher affinity for the rs340874 C allele than 
for the T allele. We conducted an in silico search with 
Genomatix software to examine whether the SNP modified 
any TF binding sites, but no putative TF was described at 
rs340874. Two TF binding sites for rs340873 and rs340835 
could be detected, but neither seemed particularly relevant 
in type 2 diabetes (Table 4). 

Proxl and GSIS. To investigate the possible consequence 
of a lower expression of Proxl in p-cells and to assess 
whether Proxl could play a role in GSIS, we knocked 
down Proxl expression by siRNAs in INS-IE cells. After 
transfection with Proxl siRNAs or NT siRNAs for 48 h, 
INS-IE cells were treated for 1 h with 3 or 30 mmol/L 
glucose. Knock down of Proxl expression was verified at 
the mRNA and protein level. Cells transfected with Proxl 
siRNAs showed 40% less Proxl mRNA (data not shown) 
and fewer proteins (Fig. 4A) than cells transfected with NT 
siRNAs. Only 40% of the inhibition of Proxl could be 
obtained despite the use of i) the optimal conditions 
based on the cyclophilin B positive control siRNA data and 
the Proxl siRNAs and 2) a pool of four different Proxl 



siRNAs. Of note, Proxl siRNAs had no effect on cyclo- 
philin B mRNA expression levels. After the cells were 
stimulated with either 3 or 30 mmol/L glucose, insulin level 
was measured in the supernatant. Knock down of Proxl 
mRNA expression levels resulted in a 1.7-fold decreased 
GSIS in Proxl siRNA cells compared with NT siRNA cells 
(P = 0.029) (Fig. 4S), indicating that Proxl stimulates GSIS 
in p-cells in normal conditions. 

Regulation of Proxl expression by glucose. Because 
type 2 diabetes is characterized by hyperglycemia, we in- 
vestigated whether the expression of Proxl could be reg- 
ulated by glucose concentration in INS-IE cells. These 
cells were cultured for 24 or 48 h in medium containing 3 
or 30 mmol/L glucose. Total RNA was extracted, and 
Proxl mRNA level was quantified relative to that of TFIIB. 
High concentrations of glucose (30 mmol/L) repressed 
Proxl expression to -20 and 45% (both P = 0.029) of the 
expression at 3 mmol/L glucose at 24 and 48 h, respectively 
(Fig. 5A). As a positive control of the treatment (22), L-PK 
mRNA expression was quantified and, as expected, was 
induced approximately sevenfold (P < 0.05) on exposure 
to 30 mmol/L glucose (data not shown). To check that 
a high glucose concentration would also modify Proxl 
protein levels, nuclear protein extracts were prepared 
from the same samples (Proxl protein being too weakly 
detected in total protein extracts). In contrast to the 
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rs340874 rs340873 rs340835 

C T AG AG 




Nuclear extract -+ + - ++ - + + -+ + -+ + - + + 

DIG-labeled probe ++ + + ++ + + + ++ + ++ + + + + 

Unlabeled probe -- + - -+ - - + -- + -- + -- + 

FIG. 3. EMSAs of PROX1 SNPs. The DIG-labeled probes carrying alleles of rs340874, rs340873, and rs340835 were incubated with reaction 
mixtures containing no nuclear extracts or 5 u.g of nuclear extracts from HepG2 cells. Specific competition was assessed using 50-fold molar 
excess of unlabeled probes. 



mRNA level, at the protein level, a high concentration of 
glucose upregulated Proxl expression by —20 (P = 0.20) 
and 50% (P = 0.11) at 24 and 48 h, respectively, relative to 
the values at 3 mmol/L (Fig. 55). 



DISCUSSION 

To our knowledge, this study is the first to 1) test the 
impact of the whole common genetic variability of PROX1 
(80 SNPs) on type 2 diabetes-related traits in adoles- 
cents, 2) analyze the sequence activity of the associated 
SNPs in HepG2 and MIN6 cells, and 3) evaluate the role 
of Proxl in GSIS in (3-cells. In the HELENA study, three 
PROX1 SNPs (rs340838, rs340837, and rs340836) were sig- 
nificantly associated, and two SNPs (rs340872 and rs340873) 
tended to be associated with plasma insulin levels. Consis- 
tently, the rs340838, rs340837, and rs340836 SNPs tended 
to be associated with HOMA-B and HOMA-IR. Haplotype 



analyses did not add any further information (data not 
shown). 

The results are in line with those previously reported in 
adults and adolescents. The MAGIC study (5) reported 
significant associations between the C allele of rs340874 
and higher FG level (effect size +0.013 mmol/L, P = 6.6 X 
10" 6 ) and type 2 diabetes (OR 1.07, P = 7.2 X 10" 10 ) 
and nominal associations with lower HOMA-B (effect size 
—0.008, P = 0.020). Unfortunately, this study did not assess 
the impact of the PROX1 rs340838, rs340837, or rs340836 
SNPs (or proxies) (MAGIC consortium, July 2011 personal 
communication). The meta-analysis on children and ado- 
lescents by Barker et al (8) showed that the C allele of 
rs340874 was associated with higher FG levels (effect size 
+0.013 mmol/L, P = 0.042) and, to a lesser extent, with 
lower HOMA-B (effect size -0.016, P = 0.067). Conversely, 
in the present study, we were unable to detect a significant 
association between rs340874 and FG levels; the sample 



TABLE 4 

TF binding sites identified by Genomatix software at the rs340874, rs340873, and rs340835 SNPs 



SNP position 


SNP 


TF creation 
or disruption 


Family/matrix name 


Start position 


End position 


Strand 


Core 
similarity 


Matrix 
similarity 


212225879 


rs340874 


















C^T 


None 














212227152 


rs340873 


















G^A 


Lost 


V$GATA/GATA1.03 


212227149 


212227161 


+ 


1.00 


0.959 




G^A 


New 


V$HOXF/HOX1-3.01 


212227150 


212227168 


+ 


1.00 


0.871 


212230298 


rs340835 


















G^A 


New 


V$BCDF/PTX1.01 


212230288 


212230304 




1.00 


0.940 




G^A 


New 


V$RXRF/PXR_RXR.01 


212230289 


212230313 


+ 


0.79 


0.802 
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NTsiRNA ProxlsiRNA 



84kDa 
25 kDa 



PROXl 
RAN 



B 



9 5 



a s 3 



3^ 2 



Glucose 
(mmol/L) 




NTsiRNA 



Proxl siRNA 



FIG. 4. The siRNA knock down of Proxl reduces the glucose-stimulated 
insulin secretion in INS- IE cells. A: INS-IE cells were transfected with 
control NT siRNAs or Proxl siRNAs. The effects of siRNAs on Proxl 
expression were measured by Western blot analysis 48 h later. RAN 
expression was used as a reference. The results represent at least three 
experiments. *P < 0.05 (Mann-Whitney nonparametric test). B: INS-IE 
cells were transfected with control NT siRNAs or Proxl siRNAs for 
48 h and then stimulated with 3 or 30 mmo]/L glucose in Krebs buffer 
for 1 h. Secreted insulin was measured by ELISA. Data show relative 
insulin secretion obtained for Proxl siRNA- relative to control NT 
siRNA-treated cells at 3 compared with 30 mmol/L glucose. The results 
represent at least three experiments. *P < 0.05 (Mann-Whitney non- 
parametric test). 



O M 




Glucose (mmol/L) 3 30 



24 h 




FIG. 5. Glucose represses Proxl expression at the mRNA level but 
enhances Proxl expression at the protein levels. INS-IE cells were 
precultured for 24 h in RPMI medium before addition of 3 or 30 mmol/L 
glucose and were incubated for 24 or 48 h. After cells were harvested, 
total RNAs and proteins were extracted, and expression of Proxl was 
quantified by real-time PCR (A) or Western blot (B) analysis. The data 
were normalized to TFIIB (A) or RAN (B) expression. The results 
represent at least three experiments. *P < 0.05 (Mann-Whitney non- 
parametric test). 



size was not large enough to detect a difference of 0.013 
mmol/L (power 12%). Nevertheless, the present results 
agree with these meta-analysis data because carriers of the 
rs340874 glucose-raising allele had lower HOMA-B than 
carriers of the other allele. It is noteworthy that rs340874 is 
in modest LD with the rs340838, rs340837, and rs340836 
SNPs (r 2 —0.60). The results indicate that this group of 
SNPs has a larger effect than rs340874 on insulin-related 
traits in adolescents. Replication studies in larger samples 
are needed to confirm these findings. 

We next tried to identify the functional variant re- 
sponsible for the association signal. Although the best as- 
sociation with insulin levels was observed for rs340837, 
other SNPs in strong LD with rs340837 could explain the 
statistical association. Using luciferase reporter assays in 
HepG2 and MIN6 cells, we show that the insulin-lowering 
alleles of rs340873, rs340835, and rs340874 are associated 
with lower luciferase activity in both cells (except for 
rs340874 in HepG2 cells only), especially rs340835 (around 
fourfold lower), suggesting that PROXl expression could 
be reduced in hepatocytes and (B-cells of individuals carrying 
these alleles. The observed association signal, therefore, 
could be explained by one of these three functional SNPs, 
even if they are not themselves statistically associated with 



glucose-related phenotypes (because of a slightly lower 
MAF and lower effect size). Although the whole regula- 
tory and genomic context was not assessed with 60-bp 
inserts, this method helped to identify functional effects of 
isolated SNPs. The specific effect of rs340874 in HepG2 cells 
suggests the activity of liver-specific regulatory elements at 
this site. EMSA indicated that some TFs are able to bind to 
the three SNPs in HepG2 cells, with some allele affinity 
differences for the rs340874 SNP. Genomatix could not de- 
tect alterations at rs340874 in putative TF binding sites. 
In the pancreas and, more specifically, in (3-cells, noth- 
ing is known about the regulation of PROXl expression. 
Other experiments, therefore, are needed to determine the 
exact mechanisms underlying the reduced expression of 
PROXl. 

We show that Proxl plays a role in GSIS in pancreatic 
p-cells because the inhibition of its expression was asso- 
ciated with a nearly twofold reduction in GSIS. As a TF, 
Proxl could regulate the expression of genes involved in 
the insulin secretion pathway. Because Proxl is known to 
be expressed in endocrine islets in late gestation embryos 
and in adults (11), subtle modifications in its expression 
caused by one or several SNPs may mediate alterations in 
GSIS and lead to type 2 diabetes. This result is in accor- 
dance with the GWAS data showing that the majority of 
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loci associated with type 2 diabetes points to primary 
defects in the (3-cell (4). 

Finally, we show that high glucose levels downregulate 
PROX1 mRNA expression but upregulate PROX1 protein 
expression in INS-IE cells. Although consistency between 
mRNA and protein expression often is assumed, examples 
of divergent trends are frequent. Jayapal et al. (23) showed 
that >30% of the analyzed genes in Streptomyces coelicolor 
exhibited divergent patterns and that nearly one-third of 
them showed opposing trends. In mice, divergent trends were 
observed, for example, for IL-18 in a model of cutaneous 
wound repair (24). It is not known which posttranscriptional 
mechanism (improved Proxl mRNA stability, increased 
translation efficiency, etc.) is responsible for this effect. 
This situation may also reflect a negative transcriptional 
feedback for the avoidance of an excess of Proxl protein. 
Nevertheless, glucose appears to modulate Proxl expres- 
sion in (3-cells. 

In conclusion, the rs340838, rs340837, and rs340836 
SNPs in PROX1 appear to play an important role in the 
regulation of insulin secretion at the population level. 
These SNPs probably tag functional cis-regulatory SNPs, 
such as rs340874, rs340873, and rs340835, that are asso- 
ciated with a lower reporter gene expression in vitro. We 
also show that Proxl plays a role in GSIS in pancreatic 
(3-cells. Although the exact mechanisms are not completely 
understood, the present results bring us a step closer to 
a better understanding of the involvement of PR0X1 in 
type 2 diabetes. 
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